Improved features using convolution-augmented transformers for keyword spotting
نویسندگان
چکیده
Transformer can effectively model long rang dependency, but suffer from uncapable to extract local feature patterns. While CNNs exploit features effectively. In this paper, we seek combine convolution and Transformers improves over using them individually, propose improved convolution-augmented transformers for keyword spotting. The are constructed with a ResNet front-end back-end in series. Using spotting task. results show that the convolution- augmented yield at least 3% improvement compared other features.
منابع مشابه
Transferable Deep Features for Keyword Spotting
Deep features, defined as the activations of hidden layers of a neural network, have given promising results applied to various vision tasks. In this paper, we explore the usefulness and transferability of deep features, applied in the context of the problem of keyword spotting (KWS). We use a state-ofthe-art deep convolutional network to extract deep features. The optimal parameters concerning...
متن کاملKeyword Spotting Using Durational Entropy
This paper deals with the task of detection of a given keyword in continuous speech. We build upon a previously proposed algorithm where a modified Viterbi search algorithm is used to detect keywords, without requiring any explicit garbage or filler models. In this work, the concept of durational entropy is used to further discard a large fraction of false alarm errors. Durational entropy is de...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملAssisted keyword indexing for lecture videos using unsupervised keyword spotting
Many students use videos to supplement learning outside the classroom. This is particularly important for students with challenged visual capacities, for whom seeing the board during lecture is di cult. For these students, we believe that recording the lectures they attend and providing e↵ective video indexing and search tools will make it easier for them to learn course subject matter at their...
متن کاملMorphological Segmentation for Keyword Spotting
• We explore the impact of morphological segmentation on Keyword Spotting (KWS). ! • Handling out-of-vocabulary (OOV) words is a major challenge in KWS we aim to alleviate this problem by utilizing sub-word units.! • We augment a state-of-the-art KWS system with subword units derived from supervised and unsupervised morphological segmentations, and compare with phonetic and syllabic segmentatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ITM web of conferences
سال: 2022
ISSN: ['2271-2097', '2431-7578']
DOI: https://doi.org/10.1051/itmconf/20224702039